memCUDA: Map Device Memory to Host Memory on GPGPU Platform

نویسندگان

  • Hai Jin
  • Bo Li
  • Ran Zheng
  • Qin Zhang
  • Wenbing Ao
چکیده

The Compute Unified Device Architecture (CUDA) programming environment from NVIDIA is a milestone towards making programming many-core GPUs more flexible to programmers. However, there are still many challenges for programmers when using CUDA. One is how to deal with GPU device memory, and data transfer between host memory and GPU device memory explicitly. In this study, source-to-source compiling and runtime library technologies are used to implement an experimental programming system based on CUDA, called memCUDA, which can automatically map GPU device memory to host memory. With some pragma directive language, programmer can directly use host memory in CUDA kernel functions, during which the tedious and error-prone data transfer and device memory management are shielded from programmer. The performance is also improved with some near-optimal technologies. Experiment results show that memCUDA programs can get similar effect with well-optimized CUDA programs with more compact source code.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance Comparison of Asynchronous Transfer Configurations for UHD Game Image Compression with GPGPU

Ultra high definition (UHD) game scenes have caused the memory bandwidth problem. The lossless DPCM-GR based compression algorithm [12] using NVIDIA CUDA(Compute Unified Device Architecture) like general purpose GPU (GPGPU) computing relieves the bandwidth problem without sacrificing image quality, which supports bit parallel pipelining. This paper increases the memory bandwidth efficiency usin...

متن کامل

DVFS Space Exploration in Power Constrained Processing-in-Memory Systems

In order to deliver high performance under stringent power constraints, future systems may include die-stacked memories with processing-in-memory (PIM) cores. Because of their proximity to the memory, PIMs are expected to target applications which require high bandwidth, implying that PIMs do not need the same computational capabilities as traditional host processor and can therefore be impleme...

متن کامل

The effect of exposed to radiofrequencry emitted from jammer on the spatial memory

Introduction: The Widespread range of mobile phone application in life has induced many problems in the societies. This subject has caused jammer router is used in some places. Jammer router blocks signal transfer from base station to the cell phone. In the other hand, the extensive application of electromagnetic field in all aspect of life has increased fears and worries about...

متن کامل

Effect of short time captopril administration on spatial memory in aging rats.

Introduction: The brain renin-angiotensin system (RAS) has been reported having a pathological role in age-related impairment in learning and memory. Therefore, angiotensin converting enzyme inhibitors (ACEi) are expected to have positive effects on memory. Longtime treatment with captopril (an angiotensin converting enzyme inhibitor) significantly attenuates the age-related impairment in learn...

متن کامل

A Dynamic Memory Allocator for heterogeneous platforms

Modern computers are built upon heterogeneous multi-core/many cores architectures (e.g. GPGPU connected to multi-core CPU). Achieving peak performance on these architectures is hard and may require a substantial programming effort. High-level programming patterns, coupled with efficient low-level runtime supports, have been proposed to relieve the programmer from worrying about low-level detail...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010